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Abstract — In this paper, we present a belief propagation (BP) 
based algoritlim for decoding non-orthogonal space-time block 
codes (STBC) from cyclic division algebras (CDA) having large 
dunensions. The proposed approach involves message passing 
on Markov random field (MRF) representation of the STBC 
MIMO system. Adoption of BP approach to decode non-ortho- 
gonal STBCs of large dimensions has not been reported so far. 
Our simulation results show that the proposed BP-based decod- 
ing achieves increasingly closer to SISO AWGN performance for 
increased number of dimensions. In addition, it also achieves 
near-capacity turbo coded BER performance; for e.g., with BP 
decoding of 24 x 24 STBC from CDA using BPSK (i.e., 576 real 
dimensions) and rate- 1/2 turbo code (i.e., 12 bps/Hz spectral ef- 
ficiency), coded BER performance close to within just about 2.5 
dB from the theoretical MIMO capacity is achieved. 

Keywords — Non-orthogonal STBCs, large dimensions, low-comp- 
lexity decoding, belief propagation, Markov random fields, high spec- 
tral efficiencies. 

I. Introduction 
Use of multiple antennas at the transmitter can offer the ben- 
efits of transmit diversity (e.g., using space-time coding) and 
high data rates (e.g., using spatial multiplexing) f\\. MIMO 
systems that employ non-orthogonal space-time block codes 
(STBC) from cyclic division algebras (CDA) for arbitrary 
number of transmit antennas, Nt, are particularly attractive 
because they can simultaneously provide both full-rate (i.e., 
Nt complex symbols per channel use, which is same as in V- 
BLAST) as well as full transmit diversity ||2|,|l3|. The 2x2 
Golden code is a well known non-orthogonal STBC from 
CDA for 2 transmit antennas ID. High spectral efficiencies 
of the order of tens of bps/Hz can be achieved using large 
non-orthogonal STBCs. For e.g., a 16 x 16 STBC from CDA 
has 256 complex symbols in it with 512 real dimensions; with 
16-QAM and rate-3/4 turbo code, this system offers a high 
spectral efficiency of 48 bps/Hz. Decoding of non-orthogonal 
STBCs with such large dimensions, however, has been a chal- 
lenge. Sphere decoder and its low-complexity variants are 
prohibitively complex for decoding such STBCs with hun- 
dreds of dimensions. Recently, we proposed a low-complexity 
near-ML achieving algorithm to decode large non-orthogonal 
STBCs from CDA; this algorithm, which is based on bit- 
flipping approach, is termed as likelihood ascent search (LAS) 
algorithm fSj-lEl. Our new contribution in this paper is that 
we present a belief propagation (BP) based approach to de- 
coding of non-orthogonal STBCs with large dimensions, and 
report very good uncoded BER and near-capacity performance. 
To our knowledge, BP has not been reported for decoding 
non-orthogonal STBCs from CDA having large dimensions. 

Belief propagation [SI is known to be well suited in several 
communication problems ||9l; e.g., decoding of turbo codes 
and LDPC codes |[TOl,|[ni, multiuser detection Ell,!!!, sig- 
nal detection in ISI channels |[T4l,|[l5l, and MIMO detection 
ifTSl . lfTTl . Taking the cue from the success of BP in decoding 
turbo codes and achieving near-capacity performance with 



large frame sizes, in this paper we achieve near-ML and near- 
capacity performance in decoding STBCs with large dimen- 
sions using BP at practical complexities. It is known that 
graphical models that represent MIMO systems are highly 
connected. While BP was initially formalized for loop-free 
graphs, it has been empirically found to work in loopy graphs 
as weU |[lOl,|[T8l. In the context of MIMO detection using 
BP, fl6l| reported a successful adoption of BP algorithm on 
Markov random fields (MRF) by employing belief propaga- 
tion meant for pairwise MRFs as described in [HI. It pre- 
sented the BER performance of the BP based detector for V- 
BLAST with iVt = 4 and iV^ = 4, 6, 8 without and with sim- 
ulated annealing. However, BP approach applied to large di- 
mension decoding in MIMO systems, particularly large non- 
orthogonal STBC MIMO systems, has not been reported so 
far In this regard, our work here is the first in reporting BP on 
MRFs for large dimension STBC MIMO decoding. Our sim- 
ulation results show that the proposed BP decoding algorithm 
performs close to within just about 2.5 dB from the theoretical 
MIMO capacity in a 24 x 24 non-orthogonal STBC MIMO 
system using BPSK and rate- 1/2 turbo code at a spectral ef- 
ficiency of 12 bps/Hz. We also present the performance of 
BP decoding in the presence of spatial correlation. We show 
that the loss in performance due to spatial correlation can be 
alleviated by using increased receive spatial dimensions. 

II. Non-Orthogonal STBC MIMO System Model 

Consider a STBC MIMO system with multiple transmit and 
multiple receive antennas. An {n,p, k) STBC is represented 
by a matrix Xc G C"^'', where n and p denote the number 
of transmit antennas and number of time slots, respectively, 
and k denotes the number of complex data symbols sent in 
one STBC matrix. The (i,j)th entry in Xc represents the 
complex number transmitted from the zth transmit antenna in 
the jth time slot. The rate of an STBC is |. Let Nr and 
Nt ~ n denote the number of receive and transmit antennas, 
respectively. Let He G C^''^^' denote the channel gain ma- 
trix, where the (i, j)th entry in He is the complex channel 
gain from the jth transmit antenna to the ith receive antenna. 
We assume that the channel gains remain constant over one 
STBC matrix duration. Assuming rich scattering, we model 
the entries of He as i.i.d CA/'(0, 1). The received space-time 
signal matrix. Ye G C^'^^^, can be written as 

Ye = HeXe+Ne, (1) 

where Ne G C^'^^^ is the noise matrix at the receiver and 
its entries are modeled as i.i.d CA/'(0, cr^ = ^YiS^), where Es 
is the average energy of the transmitted symbols, and 7 is the 
average received SNR per receive antenna HI, and the (i, j)th 
entry in Ye is the received signal at the ith receive antenna in 
the jth time slot. Consider linear dispersion STBCs, where 
Xe can be written in the form |[T| 



where Xc ^ is the ith complex data symbol, and Ac*' G C.^*^p 
is its weight matrix. The received signal model in ([TJ can be 
written in an equivalent V-BLAST form as 

k 

= ^4''(Hea«) + ne - HcXe + n,, (3) 

i=l 

where e c'^'''"'' = «ec(Ye), He £ C^''''^^"' = (I ® 
He), a<'^ G c^tpxi ^ „ec(A^'^), G C^-p'^^ = i;ec(Nc), 
Xc G C''^^ whose zth entry is the data symbol x'c \ and He G 
i^N^pxk ^jjosg jfjj column is He ai*', i = 1, 2, • ■ • , fc. For 
notational simplicity, we drop the subscripts c in (O and write 
y = Hx + n, (4) 

where x = Xc, H = H, y = y^, and n = ric. We note that 
(|4|i can be viewed as an equivalent V-BLAST representation 
of the non-orthogonal STBC MIMO system. We assume that 
the channel coefficients are known at the receiver but not at 
the transmitter. 



A. High-Rate Non-Orthogonal STBCs from CDA 
We consider square (i.e., n=p=Nt), full-rate (i.e., k~pn~ 
Nf), circulant (where the weight matrices Ac''''s are permu- 
tation type), non-orthogonal STBCs from CDA f2l|, whose 
construction for arbitrary number of transmit antennas n is 
given by the matrix in (4. a) given at the bottom of this page 
IJl- In (4. a), LOn ~ , j — V— 1, and du,v, < u,v < ii — 1 
are the data symbols from a QAM alphabet. When S ~ 
t — 1, the code in (4. a) is information lossless (ILL), and 
when d = and t = e^, it is of full-diversity and infor- 
mation lossless (FD-ILL) IJ]- High spectral efficiencies with 
large Nt can be achieved using this code construction; e.g., 
16 X 16 STBC from (4.a) using 4-QAM and rate-3/4 turbo 
code offers a spectral efficiency of 24 bps/Hz along with the 
full-diversity of order NfN.r under ML detection. However, 
since these STBCs are non-orthogonal, ML detection gets in- 
creasingly impractical for large Nf. Hence, a key challenge in 
realizing the benefits of these large non-orthogonal STBCs in 
practice is that of achieving near-ML performance for large 
Nt at low decoding complexities. The BP based decoding 
approach in this paper essentially addresses this issue. 

III. BP ON MRFs FOR Large Dimension MIMO 
Detection/Decoding 
In this section, we present the BP detection algorithm assum- 
ing a V-BLAST system with BPSK modulation. The algo- 
rithm can be applied on the equivalent V-BLAST representa- 
tion of the non-orthogonal STBC MIMO system given by (|4|l. 
BP is a technique that solves inference problems using graph- 
ical models such as factor graphs, Bayesian belief networks 





Fig. 1 

Markov random field representation of a BPSK V-BLAST 

SYSTEM WITH 4 TRANSMIT ANTENNAS . 

and MRFs. In this paper, we consider BP on MRF represen- 
tation of MIMO systems. 

A. MRF Representation of a MIMO System 

MRFs are graphs that indicate inter-dependencies between 
random variables ||9l. An MRF is an undirected graph whose 
vertices are random variables. The variables are such that any 
variable is independent of all the other variables, given its 
neighbors, i.e., 

p {xk\xi, . . . ,Xk-i,Xk+i, ■ . ■ ,xn) ^p[xk\J^{xk)], (5) 

where J\f (xk) represents the set of all nodes neighboring the 
node pertaining to the variable Xk ■ Usually, the variables in an 
MRF are constrained by a compatibility function, also known 
as a clique potential in literature. A clique of an MRF is 
a fully connected sub-graph that does not remain fully con- 
nected if any additional vertex of the MRF is included in it. 
This is sometimes called a maximal clique, but we shall use 
the term clique to refer to a maximal clique. Let there be Nc 
cliques in the MRF, and be the set of variables in clique j. 
Let ipj (xj) be the clique potential of clique j. Then the joint 
distribution of the variables is 

Nc 

p(x) oc Jl (xj), (6) 
i=i 

where x is the set of all variables in the graph. For exam- 
ple, consider the MRF representation of a BPSK V-BLAST 
system with 4 transmit antennas as shown in Fig. [T] Here, 
xi,X2,X3,X4 respectively are the binary symbols transmit- 
ted from the four transmit antennas. Each of these symbols 
assumes a value from {±1}. In a V-BLAST system, since ev- 
ery transmitted symbol interferes with every other transmit- 
ted symbol at the receiver, the MRF is fully-connected, and 
contains a single clique, namely, {xi, X2,X3, X4}. The joint 
probabiUty distribution is 

p(a;i,a::2,a;3,X4) = p (a;i)p (a;2|a::i)p (xslsi, 0:2) 

p(x4|a;i,a::2,a;3) . (7) 

B. Pairwise MRFs 
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Fig. 2 

An example of a pairwise Markov random field with 
observed (explicit) variables and hidden (implicit) variables. 

MRF are of size two. In this case, the clique potentials are all 
functions of two variables. The clique potentials can then be 
denoted as tpij {xi, x/), where variables connected 

by an edge in the MRF. 

Consider a pairwise MRF in which the Xi's denote underly- 
ing hidden variables on which the observed variables j/^'s are 
dependent 0| . Let the dependence between the hidden vari- 
able Xi and the explicit variable i/i be represented by a joint 
compatibility function <j>i {xi,yi). This scenario is shown in 
Fig. [21 In such a scenario, the joint distribution of the hidden 
and explicit variables is 



(8) 



C. Belief Propagation on Pairwise MRFs 
Consider a situation similar to that of Fig. |2] where .x^'s are 
the hidden variables and y/s are the observed variables. If 
we consider the y^'s to be fixed and write (f)i {xi, yi) simply 
as (f)i (xi), then, from the joint distribution for the hidden 
variables can be written as ID 



p(x) oc HV'ij (a;i,a;j)II(?!)i (a;^) 



(9) 



A message from node j to node i denoted as rrij^i (xi), and 
belief at node i denoted as bi{xi) are vectors of length equal 
to the number of values that the discrete variable Xi can pos- 
sibly take (e.g., length of message and belief vectors is 2 in 
BPSK since Xi € {±1})- Each element of the belief vector 
is proportional to how likely the corresponding value of Xi 
was transmitted. On the other hand, each element in the mes- 
sage vector mji{xi) is proportional to how likely xj thinks 
the corresponding value of xi has been transmitted. The be- 
Uef at node i about the state of Xi is 

hi (xi) (X (pi (xi) H nij^i {xi) . (10) 

In particular, the messages are defined as fS) 

mj^i(xi) (x^<j>j {xj)il!j^i{xj,Xi) H mk,j (xj) (11) 
Xj keN'U)\i 

Equation ( fTTl i actually constitutes an iteration, as the mes- 
sage is defined in terms of the other messages. Therefore, 
belief propagation essentially involves computing the outgo- 
ing messages from a node to each of its neighbors using the 
local joint-compatibility function and the incoming messages 
and transmitting them. 
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Fig. 3 

Message passing in an MRF for a BPSK V-BLAST system with 
4 transmit antennas. 

D. BP for MIMO Detection 

In this subsection, for the MIMO system defined by an MRF 
(e.g., as shown in the graph in Fig. [T]), we present the BP 
based detection algorithm. We observe that the MRFs for 
MIMO systems, exemplified in Fig. [l] are not pairwise MRFs. 
However, application of the BP algorithm for pairwise MRFs, 
as outlined by (O and ([TT), to the MRFs of MIMO sys- 
tems yields a low-complexity detector for MIMO systems 
with large dimension^ The joint probability distribution for 
hidden variables Xi in a pairwise MRF is given by (|9]l. We 
use the following functions (ja and ipij for MIMO detection 
using BP: 

(/),; (x,) = exp [K (xf z,) + In {p (x,)}] , (12) 
ipij (xiyXj) = 3? (cxp [~-xf RijXj)) , (13) 
where |[T6l 

(14) 



2 (15) 

Then, the message from node i to a neighboring node j in the 
BP algorithm is given by (fTTI) as given in ||8l. For a BPSK V- 
BLAST system with 4 transmit antennas, the message pass- 
ing scheme is summarized in Fig. [3] Nodes pass messages 
to each other in an iterative fashion. After the last iteration, 
beliefs are calculated locally at each node. The value of Xi 
which has the maximum belief is selected as the symbol de- 
cision. The belief is the soft-output of the detector which can 
be fed to the decoder in a coded system. 

E. Computational Complexity 

The complexity of the detection scheme comprises of three 
components, namely, i) computation of channel correlation 

^Our simulation results show that near SISO AWGN uncoded BER per- 
formance and near-capacity coded BER performance are achieved in large 
dimension V-BLAST as well as non-orthogonal STBC MIMO systems by 
this approximate approach. 



R given by (fTSl l and matched filter output z given by (fT4] i. 
i«) computation of local evidence and compatibility func- 
tion tjj in (O and (fTsl i. respectively, and iii) calculation of 
beliefs and messages during iterative message passing given 
by ( [Tol l and ( fTTl i. respectively. The computation of R and z 
involves the computation of H^H and H^y, respectively. 
In case of STBC MIMO system, two good properties of the 
STBCs from CDA are instrumental in achieving low orders 
of complexity for the computation of H^H and H^y. They 
are: i) the weight matrices Ai*''s are permutation type, and 
ii) the x matrix formed with Nf x 1-sized ai*' vec- 
tors as columns is a scaled unitary matrix. For Nt ~ Nr, 
the computation of H^y and hence z can be done in 0{N^) 
complexity, i.e., in 0{Nf) per-symbol complexity since there 
are Nf symbols in one STBC matrix. Likewise, the computa- 
tion of H^H and hence R can be done in 0{Nf) per-symbol 
complexity. In case of V-BLAST, R and z can be computed 
in 0{Nf) and 0{Nt) per-symbol complexity, respectively. 
Computation of ?/' involves 0{N^) and 0{Nt) per-symbol 
complexity for STBC and V-BLAST, respectively. The per- 
symbol complexity of computing is 0(1). The per-symbol 
complexities involved in the computation of messages and 
beliefs in a single iteration for V-BLAST are 0{Nf) and 
0{Nt), respectively; for STBC, these complexities are of or- 
der 0{p^N^) and 0{pNt), respectively. 

IV. Simulation Results 
Our simulation results have shown that the proposed BP based 
algorithm achieves increasingly closer to SISO AWGN un- 
coded BER performance for V-BLAST signals with increas- 
ing number of dimensions (e.g., performance close to within 
1 dB of 10^'^ uncoded BER for hundreds of dimensions). 
Since the dimensions in V-BLAST are in space alone, sys- 
tems with hundreds of antennas may not be realistic. On the 
other hand, use of non-orthogonal STBCs from CDA can cre- 
ate hundreds of dimensions with just tens of antennas (space) 
and tens of channel uses (time). In this section, we present 
the uncoded and coded BER performance of the proposed BP 
algorithm in decoding large non-orthogonal STBCs. 5 BP it- 
erations are used in all the simulations. 

A. Uncoded BER performance of large STBCs from CDA: 
BP decoding achieves near SISO AWGN performance for large 
STBCs: In Fig. 21 we plot the uncoded BER performance 
of 8 X 8 (64 dimensions), 16 x 16 (256 dimensions) and 
24 x 24 (576 dimensions) non-orthogonal STBCs from CDA 
for BPSK and Nt = AV, as a function of average received 
SNR per receive antenna, 7 Ql- BER plots for STBCs with 
ILL (i.e., 5 = t = 1) and FD-ILL (i.e., J = e^J, i = eJ) are 
shown. For reference purposes, we have plotted the BPSK 
BER performance on a SISO AWGN channel as well as on 
a SISO flat-fading channel. From Fig. 21 the following two 
interesting observations can be made: 

. BERs of both ILL and FD-ILL STBCs with BP decoding 
improve and approach SISO AWGN performance as the 
number of dimensions (i.e., N^) is increased. For e.g., 
the performance of 24 x 24 FD-ILL STBC is just about 
1.5 dB away from SISO AWGN performance at 10"^ 
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Uncoded BER of BP decoding of 8 x 8, 16 x 16 and 24 x 24 

NON-ORTHOGONAL STBCS. ILL (5 = t = 1) AND FD-ILL 
(5 = e^\t = ej) STBCs. Nt = Nr, BPSK. 5 BP ITERATIONS. BP 
decoding achieves near SISO AWGN performance for large sized STBCs. 

BER. This is due to the inherent ability of BP to perform 
well in large systems. 
. With the proposed BP decoding, the BER of ILL STBC 
is worse than the BER of FD-ILL STBC. This perfor- 
mance gap between ILL and FD-ILL STBCs diminishes 
with increasing Nt, indicating that for large Nt = Nr 
ILL feature of the STBC with (5 = t = 1 is good enough. 

Turbo coded BER performance: Figure |5] shows the turbo 
coded BER performance of the BP detector in a STBC MIMO 
system with BPSK and 24 x 24 FD-ILL STBC (i.e., 5 e^J, 
t = eJ) and Nt = Nr = 24. Rate-1/3 and rate-1/2 turbo 
codes achieving 8 bps/Hz and 12 bps/Hz spectral efficien- 
cies, respectively, are used. The theoretical minimum SNRs 
required to achieve these 8 and 12 bps/Hz capacities in a 
Nt ~ Nr = 24 MIMO channel, as computed from the er- 
godic capacity formula H], are also shown in Fig. |5] From 
Fig. 121 it can be seen that the vertical fall in coded BER oc- 
curs only about 2.5 dB away from the theoretical minimum 
SNRs, which is very good in terms of nearness to capacity. 
Such nearness results for large STBCs from CDA using BP 
decoding have not been reported so far 

Effect of Spatial Correlation: In generating the BER results 
in Figs. 21 and |5l we have assumed i.i.d. fading. How- 
ever, MIMO propagation conditions witnessed in practice of- 
ten render the i.i.d. fading model as inadequate. More realis- 
tic MIMO channel models that take into account the scatter- 
ing environment, spatial correlation, etc., have been investi- 
gated in the literature lfT9l - ll2TI . For example, spatial corre- 
lation at the transmit and/or receive side can affect the rank 
structure of the MIMO channel resulting in degraded MIMO 
capacity ||T9l . The structure of scattering in the propagation 
environment can also affect the capacity |20l. Hence, it is 
of interest to investigate the performance of the proposed BP 
decoder in more realistic MIMO channel models. Towards 
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Turbo coded BER of BP decoding of 24 x 24 non-orthogonal 

FD-ILL STBC {S = e^i,t = fj). Nt = Nr = 24, BPSK, TURBO 
CODE RATES: RATE-1/2 (12 BPS/HZ) AND RATE-1/3 (8 BPS/HZ). 5 BP 
ITERATIONS. BP decoding achieves near-capacity performance to within 
about 2.5 dBfrom capacity. 



this end, in this subsection, we adopt the correlated MIMO 
channel model in 1211 . which incorporates the single spatial 
correlation parameter, r, presented in 1221 . to a matrix chan- 
nel model. Figure |6] shows the simulated uncoded BER per- 
formance of 16 X 16 FD-ILL STBC with BPSK, Nt = 16, 
Nr = 16,17 for the correlation channel model in 1211 with 
r ~ 0.12. Performance of the same with i.i.d fading and 
Nt = Nr ^ 16 is also plotted for comparison. From Fig. 
|6l it can be observed that compared to i.i.d fading, there is a 
loss in diversity order in spatial correlation for Nt = Nr = 16; 
further, use of more receive antennas {Nr = 17, Nt = 16) alle- 
viates this loss in performance. We note that the proposed BP 
based decoding can be used to decode perfect codes l23l . ll24ll 
of large dimensions as well. 

V. Conclusion 

We presented a low-complexity decoding scheme based on 
BP to decode non-orthogonal STBCs from CDA having large 
dimensions. The proposed BP scheme involved message pass- 
ing on Markov random field representation of the STBC MIMO 
system. Successful application of BP for detection in large 
non-orthogonal STBC MIIVIO systems has not been reported 
so far. Simulation results showed that the BP approach to 
large STBC decoding is quite effective, achieving near-ML 
and near-capacity performance in STBCs with large dimen- 
sions. Effect of spatial correlation on the performance of 
the proposed BP decoding was presented. Extension of the 
proposed BP approach to higher order modulation (e.g.,M - 
PAM/il/-QAM) in the large STBC MIMO context can be in- 
vestigated as further extension to this work. 
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